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SEQUENCE LISTING 

<110> ADLER, JON ELLIOT 
LI, XIAODONG 
STASZEWSKI, LENA 
O ■ CONNELL , SHAWN 
ZOZULYA, SERGEY 

<120> T1R TASTE RECEPTORS AND GENES ENCODING SAME 

<130> 078003-0280681 

<140> 10/035,045 
<141> 2002-01-03 

<150> 60/259,227 
<151> 2001-01-03 

<150> 60/284,547 
<151> 2001-04-19 

<160> 24 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 876 
<212> DNA 

<213> Homo sapiens 
<400> 1 

agcctggcag tggcctcagg cagagtctga cgcgcacaaa ctttcaggcc caggaagcga 6 0 
ggacaccact ggggccccag ggtgtggcaa gtgaggatgg caagggtttt gctaaacaaa 12 0 
tcctctgccc gctccccgcc ccgggctcac tccatgtgag gdcccagtcg gggcagccac 180 
ctgccgtgcc tgttggaagt tgcctctgcc atgctgggcc ctgctgtcct gggcctcagc 240 
ctctgggctc tcctgcaccc tgggacgggg gccccattgt gcctgtcaca gcaacttagg 300 
atgaaggggg actacgtgct gggggggctg ttccccctgg gcgaggccga ggaggctggc 360 
ctccgcagcc ggacacggcc cagcagccct gtgtgcacca ggtacagagg tgggacggcc 42 0 
tgggtcgggg tcagggtgac caggtctggg gtgctcctga gctggggccg aggtggccat 480 
ctgcggttct gtgtggcccc aggttctcct caaacggcct gctctgggca ctggccatga 54 0 
aaatggccgt ggaggagatc aacaacaagt cggatctgct gcccgggctg cgcctgggct 600 
acgacctctt tgatacgtgc tcggagcctg tggtggccat gaagcccagc ctcatgttcc 660 
tggccaaggc aggcagccgc gacatcgccg cctactgcaa ctacacgcag taccagcccc 720 
gtgtgctggc tgtcatcggg ccccactcgt cagagctcgc catggtcacc ggcaagttct 780 
tcagcttctt cctcatgccc cagtggggcg ccccccacca tcacccaccc ccaaccaacc 840 
cctgccccgt gggagcccct tgtgtcagga gaatgc 876 



<210> 2 
<211> 2687 
<212> DNA 

<213> Homo sapiens 
<400> 2 

tacatgcacc ccacccagcc ctgccctggg agccctgtgt cagaagatgc tcttggcctt 60 
gcaggtcagc tacggtgcta gcatggagct gctgagcgcc cgggagacct tcccctcctt 12 0 
cttccgcacc gtgcccagcg accgtgtgca gctgacggcc gccgcggagc tgctgcagga 180 
gttcggctgg aactgggtgg ccgccctggg cagcgacgac gagtacggcc ggcagggcct 24 0 



* 



gagcatcttc tcggccctgg ccgcggcacg cggcatctgc atcgcgcacg agggcctggt 3 00 
gccgctgccc cgtgccgatg actcgcggct ggggaaggtg caggacgtcc tgcaccaggt 360 
gaaccagagc agcgtgcagg tggtgctgct gttcgcctcc gtgcacgccg cccacgccct 420 
cttcaactac agcatcagca gcaggctctc gcccaaggtg tgggtggcca gcgaggcctg 480 
gctgacctct gacctggtca tggggctgcc cggcatggcc cagatgggca cggtgcttgg 54 0 
cttcctccag aggggtgccc agctgcacga gttcccccag tacgtgaaga cgcacctggc 600 
cctggccacc gacccggcct tctgctctgc cctgggcgag agggagcagg gtctggagga 660 
ggacgtggtg ggccagcgct gcccgcagtg tgactgcatc acgctgcaga acgtgagcgc 72 0 
agggctaaat caccaccaga cgttctctgt ctacgcagct gtgtatagcg tggcccaggc 780 
cctgcacaac actcttcagt gcaacgcctc aggctgcccc gcgcaggacc ccgtgaagcc 840 
ctggcaggtg agcccgggag atgggggtgt gctgtcctct gcatgtgccc aggccaccag 900 
gcacggccac cacgcctgag ctggaggtgg ctggcggctc agccccgtcc cccgcccgca 960 
gctcctggag aacatgtaca acctgacctt ccacgtgggc gggctgccgc tgcggttcga 1020 
cagcagcgga aacgtggaca tggagtacga cctgaagctg tgggtgtggc agggctcagt 1080 
gcccaggctc cacgacgtgg gcaggttcaa cggcagcctc aggacagagc gcctgaagat 114 0 
ccgctggcac acgtctgaca accaggtgag gtgagggtgg gtgtgccagg cgtgcccgtg 12 00 
gtagcccccg cggcagggcg cagcctgggg gtgggggccg ttccagtctc ccgtgggcat 12 60 
gcccagccga gcagagccag accccaggcc tgtgcgcaga agcccgtgtc ccggtgctcg 1320 
cggcagtgcc aggagggcca ggtgcgccgg gtcaaggggt tccactcctg ctgctacgac 13 80 
tgtgtggact gcgaggcggg cagctaccgg caaaacccag gtgagccgcc ttcccggcag 144 0 
gcgggggtgg gaacgcagca ggggagggtc ctgccaagtc ctgactctga gaccagagcc 1500 
cacagggtac aagacgaaca cccagcgccc ttctcctctc tcacagacga catcgcctgc 1560 
accttttgtg gccaggatga gtggtccccg gagcgaagca cacgctgctt ccgccgcagg 1620 
tctcggttcc tggcatgggg cgagccggct gtgctgctgc tgctcctgct gctgagcctg 1680 
gcgctgggcc ttgtgctggc tgctttgggg ctgttcgttc accatcggga cagcccactg 1740 
gttcaggcct cgggggggcc cctggcctgc tttggcctgg tgtgcctggg cctggtctgc 1800 
ctcagcgtcc tcctgttccc tggccagccc agccctgccc gatgcctggc ccagcagccc 1860 
ttgtcccacc tcccgctcac gggctgcctg agcacactct tcctgcaggc ggccgagatc 1920 
ttcgtggagt cagaactgcc tctgagctgg gcagaccggc tgagtggctg cctgcggggg 1980 
ccctgggcct ggctggtggt gctgctggcc atgctggtgg aggtcgcact gtgcacctgg 204 0 
tacctggtgg ccttcccgcc ggaggtggtg acggactggc acatgctgcc cacggaggcg 2100 
ctggtgcact gccgcacacg ctcctgggtc agcttcggcc tagcgcacgc caccaatgcc 2160 
acgctggcct ttctctgctt cctgggcact ttcctggtgc ggagccagcc gggctgctac 2220 
aaccgtgccc gtggcctcac ctttgccatg ctggcctact tcatcacctg ggtctccttt 2280 
gtgcccctcc tggccaatgt gcaggtggtc ctcaggcccg ccgtgcagat gggcgccctc 2340 
ctgctctgtg tcctgggcat cctggctgcc ttccacctgc ccaggtgtta cctgctcatg 2400 
cggcagccag ggctcaacac ccccgagttc ttcctgggag ggggccctgg ggatgcccaa 2460 
ggccagaatg acgggaacac aggaaatcag gggaaacatg agtgacccaa ccctgtgatc 2 52 0 
tcagccccgg tgaacccaga cttagctgcg atccccccca agccagcaat gacccgtgtc 2580 
tcgctacaga gaccctcccg ctctaggttc tgaccccagg ttgtctcctg accctgaccc 2640 
cacagtgagc cctaggcctg gagcacgtgg acacccctgt gaccatc 2687 



<210> 3 

<211> 2559 

<212> DNA 

<213> Homo sapiens 

<400> 3 

atgctgggcc ctgctgtcct gggcctcagc ctctgggctc tcctgcaccc tgggacgggg 60 
gccccattgt gcctgtcaca gcaacttagg atgaaggggg actacgtgct gggggggctg 12 0 
ttccccctgg gcgaggccga ggaggctggc ctccgcagcc ggacacggcc cagcagccct 18 0 
gtgtgcacca ggttctcctc aaacggcctg ctctgggcac tggccatgaa aatggccgtg 24 0 
gaggagatca acaacaagtc ggatctgctg cccgggctgc gcctgggcta cgacctcttt 3 00 
gatacgtgct cggagcctgt ggtggccatg aagcccagcc tcatgttcct ggccaaggca 360 
ggcagccgcg acatcgccgc ctactgcaac tacacgcagt accagccccg tgtgctggct 420 
gtcatcgggc cccactcgtc agagctcgcc atggtcaccg gcaagttctt cagcttcttc 480 
ctcatgcccc aggtcagcta cggtgctagc atggagctgc tgagcgcccg ggagaccttc 54 0 



ccctccttct tccgcaccgt gcccagcgac cgtgtgcagc tgacggccgc cgcggagctg 600 
ctgcaggagt tcggctggaa ctgggtggcc gccctgggca gcgacgacga gtacggccgg 660 
cagggcctga gcatcttctc ggccctggcc gcggcacgcg gcatctgcat cgcgcacgag 72 0 
ggcctggtgc cgctgccccg tgccgatgac tcgcggctgg ggaaggtgca ggacgtcctg 780 
caccaggtga accagagcag cgtgcaggtg gtgctgctgt tcgcctccgt gcacgccgcc 84 0 
cacgccctct tcaactacag catcagcagc aggctctcgc ccaaggtgtg ggtggccagc 900 
gaggcctggc tgacctctga cctggtcatg gggctgcccg gcatggccca gatgggcacg 960 
gtgcttggct tcctccagag gggtgcccag ctgcacgagt tcccccagta cgtgaagacg 1020 
cacctggccc tggccaccga cccggccttc tgctctgccc tgggcgagag ggagcagggt 1080 
ctggaggagg acgtggtggg ccagcgctgc ccgcagtgtg actgcatcac gctgcagaac 1140 
gtgagcgcag ggctaaatca ccaccagacg ttctctgtct acgcagctgt gtatagcgtg 1200 
gcccaggccc tgcacaacac tcttcagtgc aacgcctcag gctgccccgc gcaggacccc 12 60 
gtgaagccct ggcagctcct ggagaacatg tacaacctga ccttccacgt gggcgggctg 1320 
ccgctgcggt tcgacagcag cggaaacgtg gacatggagt acgacctgaa gctgtgggtg 13 80 
tggcagggct cagtgcccag gctccacgac gtgggcaggt tcaacggcag cctcaggaca 144 0 
gagcgcctga agatccgctg gcacacgtct gacaaccaga agcccgtgtc ccggtgctcg 1500 
cggcagtgcc aggagggcca ggtgcgccgg gtcaaggggt tccactcctg ctgctacgac 1560 
tgtgtggact gcgaggcggg cagctaccgg caaaacccag acgacatcgc ctgcaccttt 1620 
tgtggccagg atgagtggtc cccggagcga agcacacgct gcttccgccg caggtctcgg 1680 
ttcctggcat ggggcgagcc ggctgtgctg ctgctgctcc tgctgctgag cctggcgctg 1740 
ggccttgtgc tggctgcttt ggggctgttc gttcaccatc gggacagccc actggttcag 18 00 
gcctcggggg ggcccctggc ctgctttggc ctggtgtgcc tgggcctggt ctgcctcagc 1860 
gtcctcctgt tccctggcca gcccagccct gcccgatgcc tggcccagca gcccttgtcc 1920 
cacctcccgc tcacgggctg cctgagcaca ctcttcctgc aggcggccga gatcttcgtg 1980 
gagtcagaac tgcctctgag ctgggcagac cggctgagtg gctgcctgcg ggggccctgg 2 04 0 
gcctggctgg tggtgctgct ggccatgctg gtggaggtcg cactgtgcac ctggtacctg 2100 
gtggccttcc cgccggaggt ggtgacggac tggcacatgc tgcccacgga ggcgctggtg 2160 
cactgccgca cacgctcctg ggtcagcttc ggcctagcgc acgccaccaa tgccacgctg 2220 
gcctttctct gcttcctggg cactttcctg gtgcggagcc agccgggctg ctacaaccgt 2280 
gcccgtggcc tcacctttgc catgctggcc tacttcatca cctgggtctc ctttgtgccc '2340 
ctcctggcca atgtgcaggt ggtcctcagg cccgccgtgc agatgggcgc cctcctgctc 24 00 
tgtgtcctgg gcatcctggc tgccttccac ctgcccaggt gttacctgct catgcggcag 2460 
ccagggctca acacccccga gttcttcctg ggagggggcc ctggggatgc ccaaggccag 2520 
aatgacggga acacaggaaa tcaggggaaa catgagtga 2559 



<210> 4 
<211> 852 
<212> PRT 

< 2 1 3 > Homo sapi ens 
<400> 4 

Met Leu Gly Pro Ala Val Leu Gly Leu Ser Leu Trp Ala Leu Leu His 
15 10 15 

Pro Gly Thr Gly Ala Pro Leu Cys Leu Ser Gin Gin Leu Arg Met Lys 

20 25 30 

Gly Asp Tyr Val Leu Gly Gly Leu Phe Pro Leu Gly Glu Ala Glu Glu 
35 40 45 

Ala Gly Leu Arg Ser Arg Thr Arg Pro Ser Ser Pro Val Cys Thr Arg 
50 55 60 



Phe Ser Ser Asn Gly Leu Leu Trp Ala Leu Ala Met Lys Met Ala Val 
65 70 75 80 



Glu Glu lie Asn Asn 

85 

Tyr Asp Leu Phe Asp 

100 

Ser Leu Met Phe Leu 
115 

Cys Asn Tyr Thr Gin 
130 

His Ser Ser Glu Leu 
145 

Leu Met Pro Gin Val 

165 

Arg Glu Thr Phe Pro 

180 

Gin Leu Thr Ala Ala 
195 

Val Ala Ala Leu Gly 
210 

lie Phe Ser Ala Leu 
225 

Gly Leu Val Pro Leu 

245 

Gin Asp Val Leu His 

260 

Leu Phe Ala Ser Val 
275 

Ser Ser Arg Leu Ser 
290 

Thr Ser Asp Leu Val 
305 

Val Leu Gly Phe Leu 

325 

Tyr Val Lys Thr His 

340 

Ala Leu Gly Glu Arg 
355 

Arg Cys Pro Gin Cys 
3 70 



4 

Lys Ser Asp Leu Leu Pro 

90 

Thr Cys Ser Glu Pro Val 

105 

Ala Lys Ala Gly Ser Arg 
120 

Tyr Gin Pro Arg Val Leu 
135 

Ala Met Val Thr Gly Lys 
150 155 

Ser Tyr Gly Ala Ser Met 

170 

Ser Phe Phe Arg Thr Val 

185 

Ala Glu Leu Leu Gin Glu 
200 

Ser Asp Asp Glu Tyr Gly 
215 

Ala Ala Ala Arg Gly lie 
230 235 

Pro Arg Ala Asp Asp Ser 

250 

Gin Val Asn Gin Ser Ser 

265 

His Ala Ala His Ala Leu 
280 

Pro Lys Val Trp Val Ala 
295 

Met Gly Leu Pro Gly Met 
310 315 

Gin Arg Gly Ala Gin Leu 

330 

Leu Ala Leu Ala Thr Asp 

345 

Glu Gin Gly Leu Glu Glu 
360 

Asp Cys lie Thr Leu Gin 
375 



Gly Leu Arg Leu Gly 

95 

Val Ala Met Lys Pro 
110 

Asp lie Ala Ala Tyr 
125 

Ala Val lie Gly Pro 
140 

Phe Phe Ser Phe Phe 

160 

Glu Leu Leu Ser Ala 

175 

Pro Ser Asp Arg Val 
190 

Phe Gly Trp Asn Trp 
2 05 

Arg Gin Gly Leu Ser 
220 

Cys lie Ala His Glu 

240 

Arg Leu Gly Lys Val 

255 

Val Gin Val Val Leu 
270 

Phe Asn Tyr Ser lie 
285 

Ser Glu Ala Trp Leu 
300 

Ala Gin Met Gly Thr 

320 

His Glu Phe Pro Gin 

335 

Pro Ala Phe Cys Ser 
350 

Asp Val Val Gly Gin 
365 

Asn Val Ser Ala Gly 
380 



♦ 



Leu Asn His His Gin Thr Phe Ser Val Tyr Ala Ala Val Tyr Ser Val 
385 390 395 400 

Ala Gin Ala Leu His Asn Thr Leu Gin Cys Asn Ala Ser Gly Cys Pro 

405 410 415 

Ala Gin Asp Pro Val Lys Pro Trp Gin Leu Leu Glu Asn Met Tyr Asn 

420 425 430 

Leu Thr Phe His Val Gly Gly Leu Pro Leu Arg Phe Asp Ser Ser Gly 
435 440 445 

Asn Val Asp Met Glu Tyr Asp Leu Lys Leu Trp Val Trp Gin Gly Ser 
450 455 460 

Val Pro Arg Leu His Asp Val Gly Arg Phe Asn Gly Ser Leu Arg Thr 
465 470 475 480 

Glu Arg Leu Lys lie Arg Trp His Thr Ser Asp Asn Gin Lys Pro Val 

485 490 495 

Ser Arg Cys Ser Arg Gin Cys Gin Glu Gly Gin Val Arg Arg Val Lys 

500 505 510 

Gly Phe His Ser Cys Cys Tyr Asp Cys Val Asp Cys Glu Ala Gly Ser 
515 520 525 

Tyr Arg Gin Asn Pro Asp Asp lie Ala Cys Thr Phe Cys Gly Gin Asp 
530 535 540 

Glu Trp Ser Pro Glu Arg Ser Thr Arg Cys Phe Arg Arg Arg Ser Arg 
545 550 ~ 555 560 

Phe Leu Ala Trp Gly Glu Pro Ala Val Leu Leu Leu Leu Leu Leu Leu 

565 570 575 

Ser Leu Ala Leu Gly Leu Val Leu Ala Ala Leu Gly Leu Phe Val His 

580 585 590 

His Arg Asp Ser Pro Leu Val Gin Ala Ser Gly Gly Pro Leu Ala Cys 
595 600 605 

Phe Gly Leu Val Cys Leu Gly Leu Val Cys Leu Ser Val Leu Leu Phe 
610 615 620 

Pro Gly Gin Pro Ser Pro Ala Arg Cys Leu Ala Gin Gin Pro Leu Ser 
625 630 635 640 

His Leu Pro Leu Thr Gly Cys Leu Ser Thr Leu Phe Leu Gin Ala Ala 

645 650 655 

Glu lie Phe Val Glu Ser Glu Leu Pro Leu Ser Trp Ala Asp Arg Leu 

660 665 670 



Ser Gly Cys Leu Arg Gly Pro Trp Ala Trp Leu Val Val Leu Leu Ala 
675 680 685 



Met Leu Val Glu 
690 

Pro Glu Val Val 
705 

His Cys Arg Thr 



Asn Ala Thr Leu 

740 

Ser Gin Pro Gly 
755 

Leu Ala Tyr Phe 
770 

Val Gin Val Val 
785 

Cys Val Leu Gly 



Leu Met Arg Gin 

820 

Gly Pro Gly Asp 
835 

Gly Lys His Glu 
850 



Val Ala Leu Cys 
695 

Thr Asp Trp His 
710 

Arg Ser Trp Val 
725 

Ala Phe Leu Cys 



Cys Tyr Asn Arg 

760 

lie Thr Trp Val 
775 

Leu Arg Pro Ala 
790 

lie Leu Ala Ala 
805 

Pro Gly Leu Asn 



Ala Gin Gly Gin 

840 



Thr Trp Tyr Leu 

700 

Met Leu Pro Thr 
715 

Ser Phe Gly Leu 
730 

Phe Leu Gly Thr 
745 

Ala Arg Gly Leu 



Ser Phe Val Pro 

780 

Val Gin Met Gly 
795 

Phe His Leu Pro 
810 

Thr Pro Glu Phe 
825 

Asn Asp Gly Asn 



Val Ala Phe Pro 



Glu Ala Leu Val 

720 

Ala His Ala Thr 
735 

Phe Leu Val Arg 
750 

Thr Phe Ala Met 
765 

Leu Leu Ala Asn 



Ala Leu Leu Leu 

800 

Arg Cys Tyr Leu 
815 

Phe Leu Gly Gly 
830 

Thr Gly Asn Gin 
845 



<210> 5 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<220> 

<221> modif ied_base 
<222> (3) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modif ied_base 
<222> (9) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modif iedjbase 
<222> (12) 

<22 3> a, t, c, g, other or unknown 




<220> 

<221> modif ied_base 
<222> (18) 

<223> a, t, c, g, other or unknown 
<400> 5 

cgnttyytng cntggggnga rcc 2 3 



<210> 6 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<220> 

<221> modif ied__base 
<222> (3) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modif ied_base 
<222> (6) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modif ied_base 
<222> (18) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modif ied_base 
<222> (21) 

<223> a, t, c, g, other or unknown 
<400> 6 

cgngcncgrt trtarcancc ngg 2 3 



<210> 7 
<211> 9 
<212> PRT 

<213> Homo sapiens 
<400> 7 

Arg Phe Leu Ala Trp Gly Glu Pro Ala 
1 5 



<210> 8 
<211> 8 
<212> PRT 

<213> Homo sapiens 



8 

<400> 8 

Pro Gly Cys Tyr Asn Arg Ala Arg 
1 5 



<210> 9 
<211> 552 
<212> DNA 
<213> Mus sp. 

<400> 9 

gtgctgtcac tcctcctgct gctttgcctg gtgctgggtc tagcactggc tgctctgggg 60 
ctctctgtcc accactggga cagccctctt gtccaggcct caggcggctc acagttctgc 120 
tttggcctga tctgcctagg cctcttctgc ctcagtgtcc ttctgttccc aggacggcca 180 
agctctgcca gctgccttgc acaacaacca atggctcacc tccctctcac aggctgcctg 240 
agcacactct tcctgcaagc agctgagacc tttgtggagt ctgagctgcc actgagctgg 3 00 
gcaaactggc tatgcagcta ccttcgggac tctggcctgc tagtggtact gttggccact 360 
tttgtggagg cagcactatg tgcctggtat ttgaccgctt caccagaagt ggtgacagac 42 0 
tggtcagtgc tgcccacaga ggtactggag cactgccacg tgcgttcctg ggtcaacctg 480 
ggcttggtgc acatcaccaa tgcaatggta gcttttctct gctttctggg cactttcctg 540 
gtacaagacc ag 552 



<210> 10 
<211> 184 
<212> PRT 
<213> Mus sp. 

<400> 10 
Val Leu Ser Leu 
1 

Ala Ala Leu Gly 

20 

Ala Ser Gly Gly 
35 

Phe Cys Leu Ser 
50 

Cys Leu Ala Gin 
65 

Ser Thr Leu Phe 



Pro Leu Ser Trp 

100 

Leu Leu Val Val 
115 

Trp Tyr Leu Thr 
130 



Leu Leu Leu Leu 

5 

Leu Ser Val His 



Ser Gin Phe Cys 

40 

Val Leu Leu Phe 
55 

Gin Pro Met Ala 
70 

Leu Gin Ala Ala 
85 

Ala Asn Trp Leu 



Leu Leu Ala Thr 

120 

Ala Ser Pro Glu 
135 



Cys Leu Val Leu 
10 

His Trp Asp Ser 
25 

Phe Gly Leu lie 



Pro Gly Arg Pro 

60 

His Leu Pro Leu 
75 

Glu Thr Phe Val 
90 

Cys Ser Tyr Leu 
105 

Phe Val Glu Ala 



Val Val Thr Asp 

140 



Gly Leu Ala Leu 
15 

Pro Leu Val Gin 
30 

Cys Leu Gly Leu 
45 

Ser Ser Ala Ser 



Thr Gly Cys Leu 

80 

Glu Ser Glu Leu 
95 

Arg Asp Ser Gly 
110 

Ala Leu Cys Ala 
125 

Trp Ser Val Leu 



Pro Thr Glu Val Leu Glu His Cys His Val Arg Ser Trp Val Asn Leu 
145 150 155 160 



* 



9 



Gly Leu Val His lie Thr Asn Ala Met Val Ala Phe Leu Cys Phe Leu 

165 170 175 

Gly Thr Phe Leu Val Gin Asp Gin 

180 



<210> 11 
<211> 558 
<212> DNA 
<213> Rattus sp. 

<400> 11 

gtgctgtcac ttctcctgct gctttgcctg gtgctgggcc tgacactggc tgccctgggg 60 

ctctttgtcc actactggga cagccctctt gttcaggcct caggtgggtc actgttctgc 120 

tttggcctga tctgcctagg cctcttctgc ctcagtgtcc ttctgttccc aggacgacca 180 

cgctctgcca gctgccttgc ccaacaacca atggctcacc tccctctcac aggctgcctg 240 

agcacactct tcctgcaagc agccgagatc tttgtggagt ctgagctgcc actgagttgg 3 00 

gcaaactggc tctgcagcta ccttcggggc ccctgggctt ggctggtggt actgctggcc 3 60 

actcttgtgg aggctgcact atgtgcctgg tacttgatgg ctttccctcc agaggtggtg 420 

acagattggc aggtgctgcc cacggaggta ctggaacact gccgcatgcg ttcctgggtc 480 

agcctgggct tggtgcacat caccaatgca ggggtagctt tcctctgctt tctgggcact 540 
ttcctggtac aaagccag 558 



<210> 12 
<211> 186 
<212> PRT 
<213> Rattus sp 

<400> 12 
Val Leu Ser Leu 
1 

Ala Ala Leu Gly 

20 

Ala Ser Gly Gly 
35 

Phe Cys Leu Ser 
50 

Cys Leu Ala Gin 
65 

Ser Thr Leu Phe 



Pro Leu Ser Trp 

100 

Ala Trp Leu Val 
115 

Ala Trp Tyr Leu 
130 



Leu Leu Leu Leu 
5 

Leu Phe Val His 



Ser Leu Phe Cys 

40 

Val Leu Leu Phe 
55 

Gin Pro Met Ala 
70 

Leu Gin Ala Ala 
85 

■ 

Ala Asn Trp Leu 



Val Leu Leu Ala 

120 

Met Ala Phe Pro 
135 



Cys Leu Val Leu 
10 

Tyr Trp Asp Ser 
25 

Phe Gly Leu lie 



Pro Gly Arg Pro 

60 

His Leu Pro Leu 
75 

Glu lie Phe Val 
90 

Cys Ser Tyr Leu 
105 

Thr Leu Val Glu 



Pro Glu Val Val 

140 



Gly Leu Thr Leu 
15 

Pro Leu Val Gin 
30 

Cys Leu Gly Leu 
45 

Arg Ser Ala Ser 



Thr Gly Cys Leu 

80 

Glu Ser Glu Leu 
95 

Arg Gly Pro Trp 
110 

Ala Ala Leu Cys 
125 

Thr Asp Trp Gin 
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Val Leu Pro Thr Glu Val Leu Glu 
145 150 

Ser Leu Gly Leu Val His lie Thr 

165 

Phe Leu Gly Thr Phe Leu Val Gin 

180 



His Cys Arg Met Arg Ser Trp Val 
155 160 

Asn Ala Gly Val Ala Phe Leu Cys 
170 175 

Ser Gin 
185 



<210> 13 

<211> 2577 

<212> DNA 

<213> Rattus sp. 

<400> 13 

atgccgggtt tggctatctt gggcctcagt ctggctgctt tcctggagct tgggatgggg 60 
tcctctttgt gtctgtcaca gcaattcaag gcacaagggg actatatatt gggtggacta 120 
tttcccctgg gcacaactga ggaggccact ctcaaccaga gaacacagcc caacggcatc 180 
ctatgtacca ggttctcgcc ccttggtttg ttcctggcca tggctatgaa gatggctgta 240 
gaggagatca acaatggatc tgccttgctc cctgggctgc gactgggcta tgacctgttt 300 
gacacatgct cagagccagt ggtcaccatg aagcccagcc tcatgttcat ggccaaggtg 360 
ggaagtcaaa gcattgctgc ctactgcaac tacacacagt accaaccccg tgtgctggct 420 
gtcattggtc cccactcatc agagcttgcc ctcattacag gcaagttctt cagcttcttc 480 
ctcatgccac aggtcagcta tagtgccagc atggatcggc taagtgaccg ggaaacattt 54 0 
ccatccttct tccgcacagt gcccagtgac cgggtgcagc tgcaggccgt tgtgacactg 600 
ttgcagaatt tcagctggaa ctgggtggct gccttaggta gtgatgatga ctatggccgg 660 
gaaggtctga gcatcttttc tggtctggcc aactcacgag gtatctgcat tgcacacgag 720 
ggcctggtgc cacaacatga cactagtggc caacaattgg gcaaggtggt ggatgtgcta 780 
cgccaagtga accaaagcaa agtacaggtg gtggtgctgt ttgcatctgc ccgtgctgtc 840 
tactcccttt ttagctacag catccttcat gacctctcac ccaaggtatg ggtggccagt 900 
gagtcctggc tgacctctga cctggtcatg acacttccca atattgcccg tgtgggcact 960 
gttcttgggt ttctgcagcg cggtgcccta ctgcctgaat tttcccatta tgtggagact 1020 
cgccttgccc tagctgctga cccaacattc tgtgcctccc tgaaagctga gttggatctg 1080 
gaggagcgcg tgatggggcc acgctgttca caatgtgact acatcatgct acagaacctg 1140 
tcatctgggc tgatgcagaa cctatcagct gggcagttgc accaccaaat atttgcaacc 12 00 
tatgcagctg tgtacagtgt ggctcaggcc cttcacaaca ccctgcagtg caatgtctca 1260 
cattgccaca catcagagcc tgttcaaccc tggcagctcc tggagaacat gtacaatatg 1320 
agtttccgtg ctcgagactt gacactgcag tttgatgcca aagggagtgt agacatggaa 13 80 
tatgacctga agatgtgggt gtggcagagc cctacacctg tactacatac tgtaggcacc 1440 
ttcaacggca cccttcagct gcagcactcg aaaatgtatt ggccaggcaa ccaggtgcca 1500 
gtctcccagt gctcccggca gtgcaaagat ggccaggtgc gcagagtaaa gggctttcat 1560 
tcctgctgct atgactgtgt ggactgcaag gcagggagct accggaagca tccagatgac 1620 
ttcacctgta ctccatgtgg caaggatcag tggtccccag aaaaaagcac aacctgctta 1680 
cctcgcaggc ccaagtttct ggcttggggg gagccagctg tgctgtcact tctcctgctg 1740 
ctttgcctgg tgctgggcct gacactggct gccctggggc tctttgtcca ctactgggac 1800 
agccctcttg ttcaggcctc aggtgggtca ctgttctgct ttggcctgat ctgcctaggc 1860 
ctcttctgcc tcagtgtcct tctgttccca ggacgaccac gctctgccag ctgccttgcc 1920 
caacaaccaa tggctcacct ccctctcaca ggctgcctga gcacactctt cctgcaagca 1980 
gccgagatct ttgtggagtc tgagctgcca ctgagttggg caaactggct ctgcagctac 2040 
cttcggggcc cctgggcttg gctggtggta ctgctggcca ctcttgtgga ggctgcacta 2100 
tgtgcctggt acttgatggc tttccctcca gaggtggtga cagattggca ggtgctgccc 2160 
acggaggtac tggaacactg ccgcatgcgt tcctgggtca gcctgggctt ggtgcacatc 222 0 
accaatgcag tgttagcttt cctctgcttt ctgggcactt tcctggtaca gagccagcct 2280 
ggtcgctata accgtgcccg tggcctcacc ttcgccatgc tagcttattt catcatctgg 2340 
gtctcttttg tgcccctcct ggctaatgtg caggtggcct accagccagc tgtgcagatg 2400 
ggtgctatct tattctgtgc cctgggcatc ctggccacct tccacctgcc caaatgctat 2460 
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gtacttctgt ggctgccaga gctcaacacc caggagttct tcctgggaag gagccccaag 2520 
gaagcatcag atgggaatag tggtagtagt gaggcaactc ggggacacag tgaatga 2577 



<210> 14 

<211> 858 

<212> PRT 

<213> Rattus sp. 

<400> 14 

Met Pro Gly Leu Ala lie Leu Gly Leu Ser Leu Ala Ala Phe Leu Glu 
15 10 15 

Leu Gly Met Gly Ser Ser Leu Cys Leu Ser Gin Gin Phe Lys Ala Gin 

20 25 30 

Gly Asp Tyr lie Leu Gly Gly Leu Phe Pro Leu Gly Thr Thr Glu Glu 
35 40 45 

Ala Thr Leu Asn Gin Arg Thr Gin Pro Asn Gly lie Leu Cys Thr Arg 
50 55 60 

Phe Ser Pro Leu Gly Leu Phe Leu Ala Met Ala Met Lys Met Ala Val 
65 70 75 80 

Glu Glu lie Asn Asn Gly Ser Ala Leu Leu Pro Gly Leu Arg Leu Gly 

85 90 95 

Tyr Asp Leu Phe Asp Thr Cys Ser Glu Pro Val Val Thr Met Lys Pro 

100 105 110 

Ser Leu Met Phe Met Ala Lys Val Gly Ser Gin Ser lie Ala Ala Tyr 
115 120 125 

Cys Asn Tyr Thr Gin Tyr Gin Pro Arg Val Leu Ala Val lie Gly Pro 
130 135 140 

His Ser Ser Glu Leu Ala Leu lie Thr Gly Lys Phe Phe Ser Phe Phe 
145 150 155 160 

Leu Met Pro Gin Val Ser Tyr Ser Ala Ser Met Asp Arg Leu Ser Asp 

165 170 175 

Arg Glu Thr Phe Pro Ser Phe Phe Arg Thr Val Pro Ser Asp Arg Val 

180 185 190 

Gin Leu Gin Ala Val Val Thr Leu Leu Gin Asn Phe Ser Trp Asn Trp 
195 200 205 

Val Ala Ala Leu Gly Ser Asp Asp Asp Tyr Gly Arg Glu Gly Leu Ser 
210 215 220 

lie Phe Ser Gly Leu Ala Asn Ser Arg Gly lie Cys lie Ala His Glu 
225 230 235 240 



Gly Leu Val Pro Gin His Asp Thr Ser Gly Gin Gin Leu Gly Lys Val 

245 250 255 



Val Asp Val Leu 

260 

Leu Phe Ala Ser 
275 

Leu His Asp Leu 
290 

Thr Ser Asp Leu 
305 

Val Leu Gly Phe 



Tyr Val Glu Thr 

34 0 

Ser Leu Lys Ala 
355 

Cys Ser Gin Cys 
370 

Met Gin Asn Leu 
385 

Tyr Ala Ala Val 



Cys Asn Val Ser 

420 

Leu Leu Glu Asn 
435 

Leu Gin Phe Asp 
450 

Met Trp Val Trp 
465 

Phe Asn Gly Thr 



Asn Gin Val Pro 

500 

Val Arg Arg Val 
515 

Cys Lys Ala Gly 
530 

Pro Cys Gly Lys 
545 




Arg Gin Val Asn 



Ala Arg Ala Val 

280 

Ser Pro Lys Val 
295 

Val Met Thr Leu 
310 

Leu Gin Arg Gly 
325 

Arg Leu Ala Leu 



Glu Leu Asp Leu 

360 

Asp Tyr lie Met 
375 

Ser Ala Gly Gin 
390 

Tyr Ser Val Ala 
405 

His Cys His Thr 



Met Tyr Asn Met 

440 

Ala Lys Gly Ser 
455 

Gin Ser Pro Thr 
470 

Leu Gin Leu Gin 
485 

Val Ser Gin Cys 



Lys Gly Phe His 

520 

Ser Tyr Arg Lys 
535 

Asp Gin Trp Ser 
550 
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Gin Ser Lys Val 
265 

Tyr Ser Leu Phe 



Trp Val Ala Ser 

300 

Pro Asn lie Ala 
315 

Ala Leu Leu Pro 
330 

Ala Ala Asp Pro 
345 

Glu Glu Arg Val 



Leu Gin Asn Leu 

380 

Leu His His Gin 
395 

Gin Ala Leu His 
410 

Ser Glu Pro Val 
425 

Ser Phe Arg Ala 



Val Asp Met Glu 

460 

Pro Val Leu His 
475 

His Ser Lys Met 
490 

Ser Arg Gin Cys 
505 

Ser Cys Cys Tyr 



His Pro Asp Asp 

540 

Pro Glu Lys Ser 
555 




Gin Val Val Val 
270 

Ser Tyr Ser lie 
285 

Glu Ser Trp Leu 



Arg Val Gly Thr 

320 

Glu Phe Ser His 
335 

Thr Phe Cys Ala 
350 

Met Gly Pro Arg 
365 

Ser Ser Gly Leu 



lie Phe Ala Thr 

400 

Asn Thr Leu Gin 
415 

Gin Pro Trp Gin 
430 

Arg Asp Leu Thr 
445 

Tyr Asp Leu Lys 



Thr Val Gly Thr 

480 

Tyr Trp Pro Gly 
495 

Lys Asp Gly Gin 
510 

Asp Cys Val Asp 
525 

Phe Thr Cys Thr 



Thr Thr Cys Leu 

560 
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Pro Arg Arg Pro Lys Phe Leu Ala Trp Gly Glu Pro Ala Val Leu Ser 

565 570 575 

Leu Leu Leu Leu Leu Cys Leu Val Leu Gly Leu Thr Leu Ala Ala Leu 

580 585 590 

Gly Leu Phe Val His Tyr Trp Asp Ser Pro Leu Val Gin Ala Ser Gly 
595 600 605 

Gly Ser Leu Phe Cys Phe Gly Leu lie Cys Leu Gly Leu Phe Cys Leu 
610 615 620 

Ser Val Leu Leu Phe Pro Gly Arg Pro Arg Ser Ala Ser Cys Leu Ala 
625 630 635 640 

Gin Gin Pro Met Ala His Leu Pro Leu Thr Gly Cys Leu Ser Thr Leu 

645 650 655 

Phe Leu Gin Ala Ala Glu lie Phe Val Glu Ser Glu Leu Pro Leu Ser 

660 665 670 

Trp Ala Asn Trp Leu Cys Ser Tyr Leu Arg Gly Pro Trp Ala Trp Leu 
675 680 685 

Val Val Leu Leu Ala Thr Leu Val Glu Ala Ala Leu Cys Ala Trp Tyr 
690 695 700 

Leu Met Ala Phe Pro Pro Glu Val Val Thr Asp Trp Gin Val Leu Pro 
705 710 715 720 

Thr Glu Val Leu Glu His Cys Arg Met Arg Ser Trp Val Ser Leu Gly 

725 730 735 

Leu Val His lie Thr Asn Ala Val Leu Ala Phe Leu Cys Phe Leu Gly 

740 745 750 

Thr Phe Leu Val Gin Ser Gin Pro Gly Arg Tyr Asn Arg Ala Arg Gly 
755 760 765 

Leu Thr Phe Ala Met Leu Ala Tyr Phe lie lie Trp Val Ser Phe Val 
770 775 780 

Pro Leu Leu Ala Asn Val Gin Val Ala Tyr Gin Pro Ala Val Gin Met 
785 790 795 800 

Gly Ala lie Leu Phe Cys Ala Leu Gly lie Leu Ala Thr Phe His Leu 

805 810 815 

Pro Lys Cys Tyr Val Leu Leu Trp Leu Pro Glu Leu Asn Thr Gin Glu 

820 825 830 

Phe Phe Leu Gly Arg Ser Pro Lys Glu Ala Ser Asp Gly Asn Ser Gly 
835 840 845 



Ser Ser Glu Ala Thr Arg Gly His Ser Glu 
850 855 
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<210> 15 
<211> 8194 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> modif ied_base 

<222> (1251) . . (1300) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modif ied_base 

<222> (1951) . . (2000) 

<223>a, t # c, g, other or unknown 



<400> 15 

gagaatctcg cgagatcccg tcggtccgcc ccgctgccct cccagctgcc gaaaagaggg 60 
gcctccgagc cgccggcgcc ctctgccggc aacctccgga agcacactag gaggttccag 120 
ccgatctggt cgaggggctc cacggaggac tccatttacg ttacgcaaat tccctacccc 180 
agccggccgg agagagaaag ccagaaacct cgcgaccagc catgggccac ctctccggaa 24 0 
aaacaccggg atattttttt tctcctgcag aaaaagcttt aggattggca gtttaaacaa 300 
aacatgtcta tttgcatacc ttcggtttgc atgcatttgt ttcgaagtga gcaaccctgg 360 
gtaacaaggc gaaagtatat gacaatttgc tcagaatctt aatgtcagaa aactggagac 420 
tggggcaggg gggtgtcgac tcaaagctgt gtctcattta gtaaactgag gcccaggtaa 480 
aaagttctga aacctcgcaa cacccggaga aattgtgttc cagcctccca cctcgcccca 540 
aaatgccaga gctccttttc taagccaggt gaagtcacag agcgtggaca gaacccacaa 600 
ccgtccagag gaagggtcac tgggtgccac ctggtttgca tctgtgcctt cgtcctgccc 660 
agttcctgag tgggaccgca ggcccggaat gtcaaggcaa acagtcctgc ttcagccact 720 
gggctccagt cccacccctt ttgggggcct gaagttagga agcatccggc agctgccttc 780 
tatttaagca actggcctcc ttagaggcca ctccttggcc atgccaggcg cgggcatctg 840 
gccagcatgc tgctctgcac ggctcgcctg gtcggcctgc agcttctcat ttcctgctgc 900 
tgggcctttg cctgccatag cacggagtct tctcctgact tcaccctccc cggagattac 960 
ctcctggcag gcctgttccc tctccattct ggctgtctgc aggtgaggca cagacccgag 102 0 
gtgaccctgt gtgacaggtg agtgaggggc cagcagagcc acacttagtg ggacccctgg 108 0 
ctatagggcc cctctggctg ccatcctcca aacaggacct tgcctctgcc tttgcccctt 1140 
gaactgtccc caggccttgt tcatcaatcc acttgccacc taagtgctgg ctagaccttc 1200 
ctagacactt cggccagttt ccaattattt cacccttgct gttagaatgt nnnnnnnnnn 1260 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn aattccttaa actaaatttc 1320 
tcactttctc tctctctctg gaaaacactg actaatgtag caggtttctc tgctccagga 1380 
cttcaggacc ttttcgatgc taataagttt ctccatcagg gccagcttgt tcctcctact 1440 
gagcttgaga gcccttgttg aagttgtggt ttgggggact ggaccgatga cctcaaaggt 1500 
tccctttgct cccaagcctc agagtctagg aggccagagg gtctcagcag gcctttgtcc 1560 
ttctcagctg tctcttactg gctttctcca caggtcttgt agcttcaatg agcatggcta 1620 
ccacctcttc caggctatgc ggcttggggt tgaggagata aacaactcca cggccctgct 1680 
gcccaacatc accctggggt accagctgta tgatgtgtgt tctgactctg ccaatgtgta 174 0 
tgccacgctg agagtgctct ccctgccagg gcaacaccac atagagctcc aaggagacct 1800 
tctccactat tcccctacgg tgctggcagt gattgggcct gacagcacca accgtgctgc 1860 
caccacagcc gccctgctga gccctttcct ggtgcccatg gtaagctgga gcctcagacc 1920 
tttgcccatc tcccttcagg caagtctggg nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 198 0 
nnnnnnnnnn nnnnnnnnnn gccaccatgc ccggctaatt tttttgtatt tttagtagag 2 04 0 
acggggtttc accgtgttag ccaggctggt cgcaaactcc taacctcgtg atccacccac 2100 
ctcggcctcc caatgtgctg ggattacagg tgtgagccac tgcacccggc cataatgtat 2160 
taatataata aaataattat acaactcacc ataatgtaga atcagtggga gccctgagct 2220 
tgttttccta caactagatg gtcccatctg ggggtgatgg gagacagtga cagatcatca 2280 
gacattagat tctcataagt agcgtgcaac ccagatccct cgcatgtgca gttcacagta 2340 
gggttcaagc tcctacaaga atctgatgct gctgctgatc tgacaggagg ggagcagctg 24 00 
taaatacaga tgaagcttcg cttactcacc agctgctcac ctcctcctgt gaggcccggt 2460 
tcctaacagg ccactgacct aacttctgcc ctgacctaca catgcttctc ttcttccttg 2520 
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caaactgcct ccagtggaag tccctgaagg tccccaaaca cacgggacta tttcactcct 2580 
atgcaggttt tgtctccttt gcttggaatg catcccctca ccccttgtcc ccaggcagat 2640 
tcccacccct cccccagaac ctgccccagt ggagccttcg caggtgattt gtcagtttca 2700 
caggctgagg ggtgctctcc tggtctcccc ggctccctgt atccccacac ccagcacagg 2760 
gccaggcact gggggggcct tcagtggaga ctgaaatggc tgaacgggac ctcccataga 2820 
ttagctatgc ggccagcagc gagacgctca gcgtgaagcg gcagtatccc tctttcctgc 2880 
gcaccatccc caatgacaag taccaggtgg agaccatggt gctgctgctg cagaagttcg 2940 
ggtggacctg gatctctctg gttggcagca gtgacgacta tgggcagcta ggggtgcagg 3000 
cactggagaa ccaggccact ggtcagggga tctgcattgc tttcaaggac atcatgccct 3060 
tctctgccca ggtgggcgat gagaggatgc agtgcctcat gcgccacctg gcccaggccg 3120 
gggccaccgt cgtggttgtt ttttccagcc ggcagttggc cagggtgttt ttcgagtccg 318 0 
tggtgctgac caacctgact ggcaaggtgt gggtcgcctc agaagcctgg gccctctcca 3240 
ggcacatcac tggggtgccc gggatccagc gcattgggat ggtgctgggc gtggccatcc 3300 
agaagagggc tgtccctggc ctgaaggcgt ttgaagaagc ctatgcccgg gcagacaaga 33 60 
aggcccctag gccttgccac aagggctcct ggtgcagcag caatcagctc tgcagagaat 3420 
gccaagcttt catggcacac acgatgccca agctcaaagc cttctccatg agttctgcct 3480 
acaacgcata ccgggctgtg tatgcggtgg cccatggcct ccaccagctc ctgggctgtg 3540 
cctctggagc ttgttccagg ggccgagtct acccctggca ggtaagagag cccaccccag 3600 
cacctcctgt cagggagaac agccaatcct gagatgagca gagtgggcac tctccggtca 3660 
ctctaaatgc caagggggat aaatgccact aacttgaggt tttttgtttt gttttgtttt 3720 
gttttttgag acagtctggc tctgtcaccc aggctgcagt gtagtgatgc gatctcggct 3780 
ctctgcaact tccacctcct gggttcaagt gattctcttg cctcggcctc ctgagtagct 3840 
gggattacag gcacccacca ccatgcctgg ataatttttc ttttcttttt tttttttttg 3900 
agatagagtc tcgctctgtt gcccaggctg gaatgcagtg gtgcgatctt ggctcactgt 3960 
gagctccgcc tcccaggttc actccattcc cctgcctcag cctcccaagt aggtgggact 4020 
acgggcgccc gccaccacgc ccagctaatt ttttttgtat tttgagtaga gacggggttt 4080 
caccatgtta gccaggatgg tctcaatctc ctgaccttgt catccgccca cctcgtcctc 4140 
ccaaagtgct gggattacag gcgtgagcca ccgcacccgg cctaattttt gtatttttag 4200 
tagagatggg gtttcaccat gttggccagg ctggtctcga actcctggca tcaagtgatc 4260 
ctcctgcttc ggcctcccaa agtgctggga ttacaggcat tagctctctt ctcttagaca 4320 
gatctttctc tctgatcctt gccttctctc acccactgtg tcttggaagt gtcaagtgat 4380 
aagatccagg gctaaaactg tctgtaaagg agtgtttgtt agaggcctcc tctcaggagg 4440 
ttggtgggga agattgaggg gcttcctaag aaggaaggga cgagaccttc ctgatgggct 4 500 
gaaaccacca ggacggaaac ccaggaaggc cccaggccct tgcttctggg accatgtggg 4560 
tctgtgctgt ctgtggtggc ttcatgatac gcgtttcttt cagcttttgg agcagatcca 4620 
caaggtgcat ttccttctac acaaggacac tgtggcgttt aatgacaaca gagatcccct 4680 
cagtagctat aacataattg cctgggactg gaatggaccc aagtggacct tcacggtcct 4740 
cggttcctcc acatggtctc cagttcagct aaacataaat gagaccaaaa tccagtggca 4800 
cggaaaggac aaccaggtaa tggggatgtg gctactcacc atgtaactgg cttatgggca 4 860 
acctagagcc tgggggtgat gctgacacag tgtacaggga gcaggagggg ggccccaggg 4 920 
gtccagctgc caccactcta cccatcctgg ccagggaagc agggaagaca ctccgtaggc 4 980 
gagtgtgcag atgccctggg gcggaagttc acacgaccag gggccctgcc ctgggagtga 5040 
gccctgaggg cagatgcaca gagattctgt tttctgttcc acatgtgagc tgtcctttga 5100 
cttgggcccc tacgtgtggc ccctctggct tcttacaggt gcctaagtct gtgtgttcca 5160 
gcgactgtct tgaagggcac cagcgagtgg ttacgggttt ccatcactgc tgctttgagt 5220 
gtgtgccctg tggggctggg accttcctca acaagagtgg tgagtgggca atggagcagg 5280 
cgagctaccc agcactcccg ggggctgcac ggtggaggga gggcctccct tgggccccat 5340 
gtgccctgcc ccagaaccaa ggcccagtca ctgggctgcc agttagcttc aggttggagg 5400 
acacctgcta ccagacagaa ttctgatcaa gagaatcagc cactgggtgc ggtggctcat 5460 
gcctgtaatc ccagcacttt gggaggctga ggcgggtgga tcacttgagg tcgggagttc 5520 
gagaccagcc tggccaacat ggtgaaaccc catctctacc aaaaatataa aaaattagct 558 0 
gggtgtggtg gcgcgtgcct gtaatcccag ctactcggga ggctgaggca ggagaatcac 5640 
ttgaacccag gaggcggagg ttgcagtgag ccaagatgca ttccagcctg gaccacaaag 5700 
cgagaattcg tccccccaaa aaaagaaagg aggccgggcg cggtggctca cacctgtaat 5760 
cccagcactt tgggaggccg aggtgggtgg atcacctgag gtcaggagtt cgagaccagc 5820 
ctgaccaaca tggtgaaacc ccatctctac taaaaataca aaaaaagtta gccgggcgtt 5880 
gtggcgtgtg cctgtaattc cagctactcg ggaggctgag gcaggagaat tgcttgaacc 5940 
cgggaggcgg aggttgcagt gagccaagat tgcaccattg cactccagcc tgggcgacaa 6000 
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gagaaaaact ctgtctcaaa aaaaaagaaa 
agactgaggt gtgtcctctg ttagagagct 
tcctgaattc aacctctttc tctaaatgaa 
tcctactgtc ccttctgttg cccatgccaa 
gaatacagat actagcgtgt ctgcagcaga 
caaggtcaag aacagagagc aaattaggta 
aggggccaca ttctttcctt tcaccatctc 
tcagagggtc aggactcaga gaaaccacag 
atgctggcaa gagagagggg ttacaatgcc 
tcgtaacctc tctggtatgt gggcagagct 
aaaccctcat gggatcatga gccatacaga 
taacacaggc attttgagga aatgcgtggc 
cactatgcct agtatagtct agctgccctg 
tatttcctat tcctactctg ctcatctggc 
gacctctaca gatgccagcc ttgtgggaaa 
tgcttcccgc gcactgtggt gtttttggct 
gcagctaaca cgctgctgct gctgctgctg 
ctagacaccc ctgtggtgag gtcagcaggg 
ctggcagcag gtagtggcag cctctatggc 
ttgctacgcc aggccctctt tgcccttggt 
cgctcattcc aactaatcat catcttcaag 
gcctgggtcc aaaaccacgg tgctggcctg 
cttatctgtc taacttggct ggtggtgtgg 
ttcccccatc tggtgatgct tgagtgcaca 
ttcctctaca atggcctcct ctccatcagt 
ttgccagaga actacaacga ggccaaatgt 
tcctggatcg ccttcttcac cacggccagc 
aacatgatgg ctgggctgag cagcctgagc 
tgctacgtga tcctctgccg cccagacctc 
caggactaca cgaggcgctg cggctccacc 
gccttctctg ccctgagggt cgaaggtcga 
gcatcgcggt ctggggttgg gacgtgtaag 
ctgccaataa agaagtgaaa tgcgtatctg 
acggattcaa gtctgaaccc agagcctgga 
ggcaactccg gaggcgggcc cagcgccaaa 
cattggccgt tctctgcggg gccccgccct 
cgaggcgcac ctcctggcct gcacgctttg 



gaaagaaaga attagccaac tgaaagcctt 6060 
gtcatcacaa ctcctacaaa agcagtcgta 6120 
tatagctatt gttccctttg tgccctcttg 6180 
agacagctag ctccttgaac agcttggcct 6240 
gaaaaaaaca gcattcccca tccagaaatg 63 00 
gctaaggact caggtcctta gttggtgtcc 6360 
tgtagggaca ggaatacttc ccttctgtcc 6420 
agcagcagct caggaaagtg gttcatggaa 64 80 
ctcccttggg agcaggctgc tcccatcaga 6540 
accaggttaa ggtcctccct agggtttgca 6600 
accgacctgt gtgtctccag agtctgtaat 6660 
ctcaggcccc actcccggct acccccatcc 6720 
gtacaattct cccagtatct tgcaggcccc 6780 
tctcaggaac cttcttggcc ttccctttca 6840 
gaagagtggg cacctgaggg aagccagacc 6900 
ttgcgtgagc acacctcttg ggtgctgctg 6960 
cttgggactg ctggcctgtt tgcctggcac 702 0 
ggccgcctgt gctttcttat gctgggctcc 7080 
ttctttgggg aacccacaag gcctgcgtgc 7140 
ttcaccatct tcctgtcctg cctgacagtt 7200 
ttttccacca aggtacctac attctaccac 7260 
tttgtgatga tcagctcagc ggcccagctg 7320 
accccactgc ctgctaggga ataccagcgc 7380 
gagaccaact ccctgggctt catactggcc 7440 
gcctttgcct gcagctacct gggtaaggac 7500 
gtcaccttca gcctgctctt caacttcgtg 7560 
gtctacgacg gcaagtacct gcctgcggcc 7620 
agcggcttcg gtgggtattt tctgcctaag 7680 
aacagcacag agcacttcca ggcctccatt 7740 
tgaccagtgg gtcagcaggc acggctggca 7800 
gcaggccggg ggtgtccggg aggtctttgg 7860 
cgcctgggag agcctagacc aggctccggg 7920 
gtctcctgtc gtgggagagt gtgaggtgta 7980 
aaaggctgac cgcccagatt gacgttgcta 8 04 0 
agaacagggc gaggcgtcgt ccccgcatcc 8100 
cgggggccgg agctagaagc tctacgcttc 8160 
acgt 8194 



<210> 16 

<211> 2526 

<212> DNA 

<213> Homo sapiens 

<400> 16 

atgctgctct gcacggctcg cctggtcggc 
tttgcctgcc atagcacgga gtcttctcct 
gcaggcctgt tccctctcca ttctggctgt 
ctgtgtgaca ggtcttgtag cttcaatgag 
cttggggttg aggagataaa caactccacg 
cagctgtatg atgtgtgttc tgactctgcc 
ctgccagggc aacaccacat agagctccaa 
ctggcagtga ttgggcctga cagcaccaac 
cctttcctgg tgcccatgat tagctatgcg 
cagtatccct ctttcctgcg caccatcccc 
ctgctgctgc agaagttcgg gtggacctgg 
gggcagctag gggtgcaggc actggagaac 
ttcaaggaca tcatgccctt ctctgcccag 



ctgcagcttc tcatttcctg ctgctgggcc 60 
gacttcaccc tccccggaga ttacctcctg 120 
ctgcaggtga ggcacagacc cgaggtgacc 180 
catggctacc acctcttcca ggctatgcgg 240 
gccctgctgc ccaacatcac cctggggtac 300 
aatgtgtatg ccacgctgag agtgctctcc 3 60 
ggagaccttc tccactattc ccctacggtg 420 
cgtgctgcca ccacagccgc cctgctgagc 480 
gccagcagcg agacgctcag cgtgaagcgg 54 0 
aatgacaagt accaggtgga gaccatggtg 60 0 
atctctctgg ttggcagcag tgacgactat 660 
caggccactg gtcaggggat ctgcattgct 720 
gtgggcgatg agaggatgca gtgcctcatg 780 




17 

cgccacctgg cccaggccgg ggccaccgtc gtggttgttt tttccagccg gcagttggcc 840 
agggtgtttt tcgagtccgt ggtgctgacc aacctgactg gcaaggtgtg ggtcgcctca 900 
gaagcctggg ccctctccag gcacatcact ggggtgcccg ggatccagcg cattgggatg 96 0 
gtgctgggcg tggccatcca gaagagggct gtccctggcc tgaaggcgtt tgaagaagcc 1020 
tatgcccggg cagacaagaa ggcccctagg ccttgccaca agggctcctg gtgcagcagc 1080 
aatcagctct gcagagaatg ccaagctttc atggcacaca cgatgcccaa gctcaaagcc 1140 
ttctccatga gttctgccta caacgcatac cgggctgtgt atgcggtggc ccatggcctc 12 00 
caccagctcc tgggctgtgc ctctggagct tgttccaggg gccgagtcta cccctggcag 1260 
cttttggagc agatccacaa ggtgcatttc cttctacaca aggacactgt ggcgtttaat 1320 
gacaacagag atcccctcag tagctataac ataattgcct gggactggaa tggacccaag 13 80 
tggaccttca cggtcctcgg ttcctccaca tggtctccag ttcagctaaa cataaatgag 1440 
accaaaatcc agtggcacgg aaaggacaac caggtgccta agtctgtgtg ttccagcgac 1500 
tgtcttgaag ggcaccagcg agtggttacg ggtttccatc actgctgctt tgagtgtgtg 1560 
ccctgtgggg ctgggacctt cctcaacaag agtgacctct acagatgcca gccttgtggg 1620 
aaagaagagt gggcacctga gggaagccag acctgcttcc cgcgcactgt ggtgtttttg 1680 
gctttgcgtg agcacacctc ttgggtgctg ctggcagcta acacgctgct gctgctgctg 1740 
ctgcttggga ctgctggcct gtttgcctgg cacctagaca cccctgtggt gaggtcagca 1800 
gggggccgcc tgtgctttct tatgctgggc tccctggcag caggtagtgg cagcctctat 1860 
ggcttctttg gggaacccac aaggcctgcg tgcttgctac gccaggccct ctttgccctt 1920 
ggtttcacca tcttcctgtc ctgcctgaca gttcgctcat tccaactaat catcatcttc 1980 
aagttttcca ccaaggtacc tacattctac cacgcctggg tccaaaacca cggtgctggc 2040 
ctgtttgtga tgatcagctc agcggcccag ctgcttatct gtctaacttg gctggtggtg 2100 
tggaccccac tgcctgctag ggaataccag cgcttccccc atctggtgat gcttgagtgc 2160 
acagagacca actccctggg cttcatactg gccttcctct acaatggcct cctctccatc 2220 
agtgcctttg cctgcagcta cctgggtaag gacttgccag agaactacaa cgaggccaaa 2280 
tgtgtcacct tcagcctgct cttcaacttc gtgtcctgga tcgccttctt caccacggcc 2340 
agcgtctacg acggcaagta cctgcctgcg gccaacatga tggctgggct gagcagcctg 2400 
agcagcggct tcggtgggta ttttctgcct aagtgctacg tgatcctctg ccgcccagac 2460 
ctcaacagca cagagcactt ccaggcctcc attcaggact acacgaggcg ctgcggctcc 2 520 
acctga 2526 



<210> 17 
<211> 841 
<212> PRT 

<213> Homo sapiens 



<400> 17 
Met Leu Leu Cys 
1 

Cys Cys Trp Ala 

20 

Thr Leu Pro Gly 
35 



Thr Ala Arg Leu 
5 

Phe Ala Cys His 

Asp Tyr Leu Leu 

40 



Val Gly Leu Gin 
10 

Ser Thr Glu Ser 
25 

Ala Gly Leu Phe 



Leu Leu lie Ser 
15 

Ser Pro Asp Phe 
30 

Pro Leu His Ser 
45 



Gly Cys Leu Gin Val Arg His Arg Pro Glu Val Thr Leu Cys Asp Arg 
50 55 60 

Ser Cys Ser Phe Asn Glu His Gly Tyr His Leu Phe Gin Ala Met Arg 
65 70 75 80 

Leu Gly Val Glu Glu lie Asn Asn Ser Thr Ala Leu Leu Pro Asn lie 

85 90 95 



Thr Leu Gly Tyr Gin Leu Tyr Asp Val Cys Ser Asp Ser Ala Asn Val 

100 105 110 



Tyr Ala Thr Leu 
115 

Leu Gin Gly Asp 
130 

Gly Pro Asp Ser 
145 

Pro Phe Leu Val 



Ser Val Lys Arg 

180 

Lys Tyr Gin Val 
195 

Thr Trp lie Ser 
210 

Val Gin Ala Leu 
225 

Phe Lys Asp lie 



Gin Cys Leu Met 

260 

Val Phe Ser Ser 
275 

Leu Thr Asn Leu 
290 

Leu Ser Arg His 
305 

Val Leu Gly Val 



Phe Glu Glu Ala 

340 

His Lys Gly Ser 
355 

Ala Phe Met Ala 
370 

Ser Ala Tyr Asn 
385 

His Gin Leu Leu 




Arg Val Leu Ser 

120 

Leu Leu His Tyr 
135 

Thr Asn Arg Ala 
150 

Pro Met lie Ser 
165 

Gin Tyr Pro Ser 



Glu Thr Met Val 

200 

Leu Val Gly Ser 
215 

Glu Asn Gin Ala 
230 

Met Pro Phe Ser 
245 

Arg His Leu Ala 



Arg Gin Leu Ala 

280 

Thr Gly Lys Val 
295 

He Thr Gly Val 
310 

Ala He Gin Lys 
325 

Tyr Ala Arg Ala 



Trp Cys Ser Ser 

360 

His Thr Met Pro 
375 

Ala Tyr Arg Ala 
390 

Gly Cys Ala Ser 
405 



18 



Leu Pro Gly Gin 



Ser Pro Thr Val 

140 

Ala Thr Thr Ala 
155 

Tyr Ala Ala Ser 
170 

Phe Leu Arg Thr 
185 

Leu Leu Leu Gin 



Ser Asp Asp Tyr 

220 

Thr Gly Gin Gly 
235 

Ala Gin Val Gly 
250 

Gin Ala Gly Ala 
265 

Arg Val Phe Phe 



Trp Val Ala Ser 

300 

Pro Gly He Gin 
315 

Arg Ala Val Pro 
330 

Asp Lys Lys Ala 
345 

Asn Gin Leu Cys 



Lys Leu Lys Ala 

380 

Val Tyr Ala Val 
395 

Gly Ala Cys Ser 
410 




His His He Glu 
125 

Leu Ala Val He 



Ala Leu Leu Ser 

160 

Ser Glu Thr Leu 
175 

He Pro Asn Asp 
190 

Lys Phe Gly Trp 
205 

Gly Gin Leu Gly 



He Cys He Ala 

240 

Asp Glu Arg Met 
255 

Thr Val Val Val 
270 

Glu Ser Val Val 
285 

Glu Ala Trp Ala 



Arg He Gly Met 

320 

Gly Leu Lys Ala 
335 

Pro Arg Pro Cys 
350 

Arg Glu Cys Gin 
365 

Phe Ser Met Ser 



Ala His Gly Leu 

400 

Arg Gly Arg Val 
415 



Tyr Pro Trp Gin 

420 

His Lys Asp Thr 
435 

Tyr Asn lie lie 
450 

Val Leu Gly Ser 
465 

Thr Lys lie Gin 



Cys Ser Ser Asp 

500 

His His Cys Cys 
515 

Asn Lys Ser Asp 
530 

Ala Pro Glu Gly 
54 5 

Ala Leu Arg Glu 



Leu Leu Leu Leu 

580 

Asp Thr Pro Val 
595 

Leu Gly Ser Leu 
610 

Glu Pro Thr Arg 
625 

Gly Phe Thr lie 



He He He Phe 

660 

Trp Val Gin Asn 
675 

Ala Gin Leu Leu 
690 

Pro Ala Arg Glu 
705 




Leu Leu Glu Gin 



Val Ala Phe Asn 

440 

Ala Trp Asp Trp 
455 

Ser Thr Trp Ser 
470 

Trp His Gly Lys 
485 

Cys Leu Glu Gly 



Phe Glu Cys Val 

520 

Leu Tyr Arg Cys 
535 

Ser Gin Thr Cys 
550 

His Thr Ser Trp 
565 

Leu Leu Gly Thr 



Val Arg Ser Ala 

600 

Ala Ala Gly Ser 
615 

Pro Ala Cys Leu 
630 

Phe Leu Ser Cys 
645 

Lys Phe Ser Thr 



His Gly Ala Gly 

680 

He Cys Leu Thr 
695 

Tyr Gin Arg Phe 
710 
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He His Lys Val 
425 

Asp Asn Arg Asp 

Asn Gly Pro Lys 

460 

Pro Val Gin Leu 
475 

Asp Asn Gin Val 
490 

His Gin Arg Val 
505 

Pro Cys Gly Ala 



Gin Pro Cys Gly 

540 

Phe Pro Arg Thr 
555 

Val Leu Leu Ala 
570 

Ala Gly Leu Phe 
585 

Gly Gly Arg Leu 



Gly Ser Leu Tyr 

620 

Leu Arg Gin Ala 
635 

Leu Thr Val Arg 
650 

Lys Val Pro Thr 
665 

Leu Phe Val Met 



Trp Leu Val Val 

700 

Pro His Leu Val 
715 




His Phe Leu Leu 
430 

Pro Leu Ser Ser 
445 

Trp Thr Phe Thr 



Asn He Asn Glu 

480 

Pro Lys Ser Val 
495 

Val Thr Gly Phe 
510 

Gly Thr Phe Leu 
525 

Lys Glu Glu Trp 



Val Val Phe Leu 

560 

Ala Asn Thr Leu 
575 

Ala Trp His Leu 
590 

Cys Phe Leu Met 
605 

Gly Phe Phe Gly 



Leu Phe Ala Leu 

640 

Ser Phe Gin Leu 
655 

Phe Tyr His Ala 
670 

He Ser Ser Ala 
685 

Trp Thr Pro Leu 



Met Leu Glu Cys 

720 
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Thr Glu Thr Asn 



Leu Leu Ser lie 

74 0 

Pro Glu Asn Tyr 
755 

Asn Phe Val Ser 
770 

Gly Lys Tyr Leu 
785 

Ser Ser Gly Phe 



Cys Arg Pro Asp 

820 

Asp Tyr Thr Arg 
835 



Ser Leu Gly Phe 
725 

Ser Ala Phe Ala 



Asn Glu Ala Lys 

760 

Trp lie Ala Phe 
775 

Pro Ala Ala Asn 
790 

Gly Gly Tyr Phe 
805 

Leu Asn Ser Thr 



Arg Cys Gly Ser 

840 



lie Leu Ala Phe 
730 

Cys Ser Tyr Leu 
745 

Cys Val Thr Phe 



Phe Thr Thr Ala 

780 

Met Met Ala Gly 
795 

Leu Pro Lys Cys 
810 

Glu His Phe Gin 
825 

Thr 



Leu Tyr Asn Gly 
735 

Gly Lys Asp Leu 
750 

Ser Leu Leu Phe 
765 

Ser Val Tyr Asp 



Leu Ser Ser Leu 

800 

Tyr Val lie Leu 
815 

Ala Ser lie Gin 
830 



<210> 18 
<211> 14 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Consensus 
sequence 

<220> 

<221> M0D_RES 

<222> (1) 

<223> Thr or Arg 

<220> 

<221> MOD_RES 

<222> (3) 

<223> Phe or Leu 

<220> 

<221> M0D_RES 
<222> (4) 

<223> Arg, Gin or Pro 
<220> 

<221> MOD_RES 

<222> (6) 

<22 3> Arg or Thr 

<220> 

<221> MOD__RES 
<222> (7) 

<223> Ser, Pro or Val 
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<220> 

<221> MOD_RES 
<222> (8) 

<223> Val, Glu, Arg, Lys or Thr 
<220> 

<221> MOD_RES 

<222> (11) 

<223> Ala or Glu 

<220> 

<221> MOD_RES 

<222> (12) 

<223> Trp or Leu 

<220> 

<221> MODJRES 
<222> (13) 

<223> Arg, His or Gly 
<400> 18 

Xaa Cys Xaa Xaa Arg Xaa Xaa Xaa Phe Leu Xaa Xaa Xaa Glu 
15 10 



<210> 19 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Consensus 
sequence 

<220> 

<221> MOD_RES 

<222> (1) 

<223> Leu or Gin 

<220> 

<221> MOD_RES 
<222> (3) 

<223> Glu, Gly or Thr 
<220> 

<221> M0D_RES 
<222> (4) 

<223> Asn, Arg or Cys 
<220> 

<221> MOD_RES 

<222> (7) 

<223> Arg or Glu 

<220> 

<221> MOD_RES 
<222> (9) 
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<223> Arg or Lys 
<220> 

<221> MOD_RES 
<222> (10) 

<223> Cys, Gly or Phe 
<220> 

<221> MOD_RES 
<222> (11) 

<223> Val, Leu or He 
<220> 

<221> MOD_RES 

<222> (13) 

<223> Phe or Leu 

<220> 

<221> MOD_RES 

<222> (14) 

<223> Ala or Ser 

<220> 

<221> MOD_RES 

<222> (15) 

<223> Met or Leu 

<400> 19 

Xaa Pro Xaa Xaa Tyr Asn Xaa Ala Xaa Xaa Xaa Thr Xaa Xaa Xaa 
15 10 15 



<210> 20 
<211> 3563 
<212> DNA 

<213> Homo sapiens 
<400> 20 

agcctggcag tggcctcagg cagagtctga cgcgcacaaa ctttcaggcc caggaagcga 6 0 
ggacaccact ggggccccag ggtgtggcaa gtgaggatgg caagggtttt gctaaacaaa 120 
tcctctgccc gctccccgcc ccgggctcac tccatgtgag gccccagtcg gggcagccac 180 
ctgccgtgcc tgttggaagt tgcctctgcc atgctgggcc ctgctgtcct gggcctcagc 24 0 
ctctgggctc tcctgcaccc tgggacgggg gccccattgt gcctgtcaca gcaacttagg 3 00 
atgaaggggg actacgtgct gggggggctg ttccccctgg gcgaggccga ggaggctggc 3 60 
ctccgcagcc ggacacggcc cagcagccct gtgtgcacca ggtacagagg tgggacggcc 420 
tgggtcgggg tcagggtgac caggtctggg gtgctcctga gctggggccg aggtggccat 4 80 
ctgcggttct gtgtggcccc aggttctcct caaacggcct gctctgggca ctggccatga 540 
aaatggccgt ggaggagatc aacaacaagt cggatctgct gcccgggctg cgcctgggct 600 
acgacctctt tgatacgtgc tcggagcctg tggtggccat gaagcccagc ctcatgttcc 660 
tggccaaggc aggcagccgc gacatcgccg cctactgcaa ctacacgcag taccagcccc 720 
gtgtgctggc tgtcatcggg ccccactcgt cagagctcgc catggtcacc ggcaagttct 780 
tcagcttctt cctcatgccc cagtggggcg ccccccacca tcacccaccc ccaaccaacc 840 
cctgccccgt gggagcccct tgtgtcagga gaatgctaca tgcaccccac ccagccctgc 900 
cctgggagcc ctgtgtcaga agatgctctt ggccttgcag gtcagctacg gtgctagcat 960 
ggagctgctg agcgcccggg agaccttccc ctccttcttc cgcaccgtgc ccagcgaccg 1020 
tgtgcagctg acggccgccg cggagctgct gcaggagttc ggctggaact gggtggccgc 1080 
cctgggcagc gacgacgagt acggccggca gggcctgagc atcttctcgg ccctggccgc 114 0 
ggcacgcggc atctgcatcg cgcacgaggg cctggtgccg ctgccccgtg ccgatgactc 1200 
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gcggctgggg aaggtgcagg acgtcctgca ccaggtgaac cagagcagcg tgcaggtggt 1260 
gctgctgttc gcctccgtgc acgccgccca cgccctcttc aactacagca tcagcagcag 1320 
gctctcgccc aaggtgtggg tggccagcga ggcctggctg acctctgacc tggtcatggg 1380 
gctgcccggc atggcccaga tgggcacggt gcttggcttc ctccagaggg gtgcccagct 1440 
gcacgagttc ccccagtacg tgaagacgca cctggccctg gccaccgacc cggccttctg 1500 
ctctgccctg ggcgagaggg agcagggtct ggaggaggac gtggtgggcc agcgctgccc 1560 
gcagtgtgac tgcatcacgc tgcagaacgt gagcgcaggg ctaaatcacc accagacgtt 162 0 
ctctgtctac gcagctgtgt atagcgtggc ccaggccctg cacaacactc ttcagtgcaa 1680 
cgcctcaggc tgccccgcgc aggaccccgt gaagccctgg caggtgagcc cgggagatgg 1740 
gggtgtgctg tcctctgcat gtgcccaggc caccaggcac ggccaccacg cctgagctgg 1800 
aggtggctgg cggctcagcc ccgtcccccg cccgcagctc ctggagaaca tgtacaacct 1860 
gaccttccac gtgggcgggc tgccgctgcg gttcgacagc agcggaaacg tggacatgga 1920 
gtacgacctg aagctgtggg tgtggcaggg ctcagtgccc aggctccacg acgtgggcag 1980 
gttcaacggc agcctcagga cagagcgcct gaagatccgc tggcacacgt ctgacaacca 2040 
ggtgaggtga gggtgggtgt gccaggcgtg cccgtggtag cccccgcggc agggcgcagc 2100 
ctggSSBt-Sg gggccgttcc agtctcccgt gggcatgccc agccgagcag agccagaccc 216 0 
caggcctgtg cgcagaagcc cgtgtcccgg tgctcgcggc agtgccagga gggccaggtg 2220 
cgccgggtca aggggttcca ctcctgctgc tacgactgtg tggactgcga ggcgggcagc 22 80 
taccggcaaa acccaggtga gccgccttcc cggcaggcgg gggtgggaac gcagcagggg 234 0 
agggtcctgc caagtcctga ctctgagacc agagcccaca gggtacaaga cgaacaccca 2400 
gcgcccttct cctctctcac agacgacatc gcctgcacct tttgtggcca ggatgagtgg 2460 
tccccggagc gaagcacacg ctgcttccgc cgcaggtctc ggttcctggc atggggcgag 2520 
ccggctgtgc tgctgctgct cctgctgctg agcctggcgc tgggccttgt gctggctgct 2580 
ttggggctgt tcgttcacca tcgggacagc ccactggttc aggcctcggg ggggcccctg 2640 
gcctgctttg gcctggtgtg cctgggcctg gtctgcctca gcgtcctcct gttccctggc 2700 
cagcccagcc ctgcccgatg cctggcccag cagcccttgt cccacctccc gctcacgggc 2760 
tgcctgagca cactcttcct gcaggcggcc gagatcttcg tggagtcaga actgcctctg 2820 
agctgggcag accggctgag tggctgcctg cgggggccct gggcctggct ggtggtgctg 288 0 
ctggccatgc tggtggaggt cgcactgtgc acctggtacc tggtggcctt cccgccggag 294 0 
gtggtgacgg actggcacat gctgcccacg gaggcgctgg tgcactgccg cacacgctcc 3000 
tgggtcagct tcggcctagc gcacgccacc aatgccacgc tggcctttct ctgcttcctg 3060 
ggcactttcc tggtgcggag ccagccgggc tgctacaacc gtgcccgtgg cctcaccttt 3120 
gccatgctgg cctacttcat cacctgggtc tcctttgtgc ccctcctggc caatgtgcag 3180 
gtggtcctca ggcccgccgt gcagatgggc gccctcctgc tctgtgtcct gggcatcctg 324 0 
gctgccttcc acctgcccag gtgttacctg ctcatgcggc agccagggct caacaccccc 3300 
gagttcttcc tgggaggggg ccctggggat gcccaaggcc agaatgacgg gaacacagga 3360 
aatcagggga aacatgagtg acccaaccct gtgatctcag ccccggtgaa cccagactta 342 0 
gctgcgatcc cccccaagcc agcaatgacc cgtgtctcgc tacagagacc ctcccgctct 3480 
aggttctgac cccaggttgt ctcctgaccc tgaccccaca gtgagcccta ggcctggagc 354 0 
acgtggacac ccctgtgacc ate 3563 



<210> 21 
<211> 839 
<212> PRT 

<213> Homo sapiens 
<400> 21 

Met Gly Pro Arg Ala Lys Thr He Cys Ser Leu Phe Phe Leu Leu Trp 
15 10 15 

Val Leu Ala Glu Pro Ala Glu Asn Ser Asp Phe Tyr Leu Pro Gly Asp 

20 25 30 

Tyr Leu Leu Gly Gly Leu Phe Ser Leu His Ala Asn Met Lys Gly He 
35 40 45 



Val His Leu Asn 
50 

Lys Val lie Gly 
65 

Glu lie Asn Asn 



Glu He Val Asp 

100 

Tyr Phe Leu Ala 
115 

Ser Asn Tyr He 
130 

Glu Ser Val Met 
145 

Gin He Thr Tyr 



Phe Pro Ala Leu 

180 

Ala Met Val Gin 
195 

Leu Val Ser Ser 
210 

Glu Arg Val Ala 
225 

Pro Thr Leu Gin 



Leu Val Thr He 

260 

Val Val Phe Ser 
275 

Leu Arg Gin Asn 
290 

Ala He Asp Pro 
305 

Thr Phe Leu Gly 



Phe Leu Gin Val 
55 

Tyr Asn Leu Met 
70 

Asp Ser Ser Leu 
85 

Val Cys Tyr He 



His Glu Asp Asn 

120 

Ser Arg Val Val 
135 

Thr Val Ala Asn 
150 

Ser Ala He Ser 
165 

Leu Arg Thr Thr 



Leu Met Leu His 

200 

Asp Thr Tyr Gly 
215 

Arg Arg Asp He 
230 

Pro Asn Gin Asn 
245 

Val Asp Lys Leu 



Pro Asp Leu Thr 

280 

Phe Thr Gly Ala 
295 

Val Leu His Asn 
310 

He Thr He Gin 
325 



24 

Pro Met Cys Lys 

60 

Gin Ala Met Arg 
75 

Leu Pro Gly Val 
90 

Ser Asn Asn Val 
105 

Leu Leu Pro He 



Ala Val He Gly 

140 

Phe Leu Ser Leu 
155 

Asp Glu Leu Arg 
170 

Pro Ser Ala Asp 
185 

Phe Arg Trp Asn 



Arg Asp Asn Gly 

220 

Cys He Ala Phe 
235 

Met Thr Ser Glu 
250 

Gin Gin Ser Thr 
265 

Leu Tyr His Phe 



Val Trp He Ala 

300 

Leu Thr Glu Leu 
315 

Ser Val Pro He 
330 



Glu Tyr Glu Val 



Phe Ala Val Glu 

80 

Leu Leu Gly Tyr 
95 

Gin Pro Val Leu 
110 

Gin Glu Asp Tyr 
125 

Pro Asp Asn Ser 



Phe Leu Leu Pro 

160 

Asp Lys Val Arg 
175 

His His Val Glu 
190 

Trp He He Val 
205 

Gin Leu Leu Gly 



Gin Glu Thr Leu 

240 

Glu Arg Gin Arg 
255 

Ala Arg Val Val 
270 

Phe Asn Glu Val 
285 

Ser Glu Ser Trp 



Gly His Leu Gly 

320 

Pro Gly Phe Ser 
335 



Glu Phe Arg Glu Trp Gly Pro Gin Ala Gly Pro Pro Pro Leu Ser Arg 

340 345 350 



Thr Ser Gin Ser 
355 

Ala Thr Leu Ser 
370 

Val Tyr Ser Val 
385 

Ser Leu Leu Gly 



Pro Trp Gin Leu 

420 

Asp His Gin lie 
435 

Glu He Val Gin 
450 

Val Ala Ser Tyr 
465 

He Ser Trp His 



Lys Arg Cys Gin 

500 

Cys Cys Phe Glu 
515 

Thr Glu Asp Glu 
530 

Tyr Gin Ser Glu 
545 

Trp His Glu Ala 



Phe Leu Ser Thr 

580 

Thr Pro He Val 
595 

Thr Leu Leu Leu 
610 

Pro Lys Val Ser 
625 

Phe Thr He Cys 




Tyr Thr Cys Asn 

3 60 

Phe Asn Thr lie 
375 

Tyr Ser Ala Val 
390 

Cys Asp Lys Ser 
405 

Leu Glu Glu He 



Phe Phe Asp Pro 

440 

Trp Gin Trp Asp 
455 

Tyr Pro Leu Gin 
470 

Thr Val Asn Asn 
485 

Ser Gly Gin Lys 



Cys He Asp Cys 

520 

Tyr Glu Cys Gin 
535 

Thr Ser Cys Phe 
550 

Pro Thr He Ala 
565 

Leu Ala He Leu 



Arg Ser Ala Gly 

600 

Val Ala Tyr Met 
615 

Thr Cys Leu Cys 
630 

He Ser Cys He 
645 



25 

Gin Glu Cys Asp 



Leu Arg Leu Ser 

380 

Tyr Ala Val Ala 
395 

Thr Cys Thr Lys 
410 

Trp Lys Val Asn 
425 

Gin Gly Asp Val 



Arg Ser Gin Asn 

460 

Arg Gin Leu Lys 
475 

Thr lie Pro Met 
490 

Lys Lys Pro Val 
505 

Leu Pro Gly Thr 



Ala Cys Pro Asn 

540 

Lys Arg Gin Leu 
555 

Val Ala Leu Leu 
570 

Val He Phe Trp 
585 

Gly Pro Met Cys 



Val Val Pro Val 

620 

Arg Gin Ala Leu 
635 

Ala Val Arg Ser 
650 



Asn Cys Leu Asn 
365 

Gly Glu Arg Val 



His Ala Leu His 

400 

Arg Val Val Tyr 
415 

Phe Thr Leu Leu 
430 

Ala Leu His Leu 
445 

Pro Phe Gin Ser 



Asn He Gin Asp 

480 

Ser Met Cys Ser 
495 

Gly He His Val 
510 

Phe Leu Asn His 
525 

Asn Glu Trp Ser 



Val Phe Leu Glu 

560 

Ala Ala Leu Gly 
575 

Arg His Phe Gin 
590 

Phe Leu Met Leu 
605 

Tyr Val Gly Pro 



Phe Pro Leu Cys 

640 

Phe Gin He Val 
655 



26 



Cys Ala Phe Lys 

660 

Val Arg Tyr Gin 
675 

Lys Met Val lie 
690 

Thr Thr Arg Thr 
705 

Asn Pro Asn Tyr 



Leu Leu Ser Val 

740 

Pro Thr Asn Tyr 
755 

Tyr Phe Thr Ser 
770 

Gly Val Leu Val 
785 

Leu Ala lie Ser 



Phe Tyr Pro Glu 

820 

Gly Tyr Thr Met 
835 



Met Ala Ser Arg 



Gly Pro Tyr Val 

680 

Val Val He Gly 
695 

Asp Pro Asp Asp 
710 

Arg Asn Ser Leu 
725 

Val Gly Phe Ser 



Asn Glu Ala Lys 

760 

Ser Val Ser Leu 
775 

Thr He Val Asp 
790 

Leu Gly Tyr Phe 
805 

Arg Asn Thr Pro 



Arg Arg Asp 



Phe Pro Arg Ala 
665 

Ser Met Ala Phe 



Met Leu Ala Thr 

700 

Pro Lys He Thr 
715 

Leu Phe Asn Thr 
730 

Phe Ala Tyr Met 
745 

Phe He Thr Leu 



Cys Thr Phe Met 

780 

Leu Leu Val Thr 
795 

Gly Pro Lys Cys 
810 

Ala Tyr Phe Asn 
825 



Tyr Ser Tyr Trp 
670 

He Thr Val Leu 
685 

Gly Leu Ser Pro 



He Val Ser Cys 

720 

Ser Leu Asp Leu 
735 

Gly Lys Glu Leu 
750 

Ser Met Thr Phe 
765 

Ser Ala Tyr Ser 



Val Leu Asn Leu 

800 

Tyr Met He Leu 
815 

Ser Met He Gin 
830 



<210> 22 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
PDZIP peptide sequence 

<400> 22 

Ser Val Ser Thr Val Val 
1 5 



<210> 23 

<211> 2520 

<212> DNA 

<213> Homo sapiens 

<400> 23 

atggggccca gggcaaagac catctgctcc ctgttcttcc tcctatgggt cctggctgag 



27 



ccggctgaga actcggactt ctacctgcct ggggattacc tcctgggtgg cctcttctcc 120 
ctccatgcca acatgaaggg cattgttcac cttaacttcc tgcaggtgcc catgtgcaag 180 
gagtatgaag tgaaggtgat aggctacaac ctcatgcagg ccatgcgctt cgcggtggag 24 0 
gagatcaaca atgacagcag cctgctgcct ggtgtgctgc tgggctatga gatcgtggat 3 00 
gtgtgctaca tctccaacaa tgtccagccg gtgctctact tcctggcaca cgaggacaac 360 
ctccttccca tccaagagga ctacagtaac tacatttccc gtgtggtggc tgtcattggc 420 
cctgacaact ccgagtctgt catgactgtg gccaacttcc tctccctatt tctccttcca 480 
cagatcacct acagcgccat cagcgatgag ctgcgagaca aggtgcgctt cccggctttg 540 
ctgcgtacca cacccagcgc cgaccaccac gtcgaggcca tggtgcagct gatgctgcac 600 
ttccgctgga actggatcat tgtgctggtg agcagcgaca cctatggccg cgacaatggc 660 
cagctgcttg gcgagcgcgt ggcccggcgc gacatctgca tcgccttcca ggagacgctg 720 
cccacactgc agcccaacca gaacatgacg tcagaggagc gccagcgcct ggtgaccatt 780 
gtggacaagc tgcagcagag cacagcgcgc gtcgtggtcg tgttctcgcc cgacctgacc 840 
ctgtaccact tcttcaatga ggtgctgcgc cagaacttca cgggcgccgt gtggatcgcc 900 
tccgagtcct gggccatcga cccggtcctg cacaacctca cggagctggg ccacttgggc 960 
accttcctgg gcatcaccat ccagagcgtg cccatcccgg gcttcagtga gttccgcgag 1020 
tggggcccac aggctgggcc gccacccctc agcaggacca gccagagcta tacctgcaac 1080 
caggagtgcg acaactgcct gaacgccacc ttgtccttca acaccattct caggctctct 1140 
ggggagcgtg tcgtctacag cgtgtactct gcggtctatg ctgtggccca tgccctgcac 1200 
agcctcctcg gctgtgacaa aagcacctgc accaagaggg tggtctaccc ctggcagctg 1260 
cttgaggaga tctggaaggt caacttcact ctcctggacc accaaatctt cttcgacccg 1320 
caaggggacg tggctctgca cttggagatt gtccagtggc aatgggaccg gagccagaat 13 8 0 
cccttccaga gcgtcgcctc ctactacccc ctgcagcgac agctgaagaa catccaagac 1440 
atctcctggc acaccgtcaa caacacgatc cctatgtcca tgtgttccaa gaggtgccag 1500 
tcagggcaaa agaagaagcc tgtgggcatc cacgtctgct gcttcgagtg catcgactgc 1560 
cttcccggca ccttcctcaa ccacactgaa gatgaatatg aatgccaggc ctgcccgaat 1620 
aacgagtggt cctaccagag tgagacctcc tgcttcaagc ggcagctggt cttcctggaa 1680 
tggcatgagg cacccaccat cgctgtggcc ctgctggccg ccctgggctt cctcagcacc 1740 
ctggccatcc tggtgatatt ctggaggcac ttccagacac ccatagttcg ctcggctggg 1800 
ggccccatgt gcttcctgat gctgacactg ctgctggtgg catacatggt ggtcccggtg 1860 
tacgtggggc cgcccaaggt ctccacctgc ctctgccgcc aggccctctt tcccctctgc 1920 
ttcacaattt gcatctcctg tatcgccgtg cgttctttcc agatcgtctg cgccttcaag 1980 
atggccagcc gcttcccacg cgcctacagc tactgggtcc gctaccaggg gccctacgtc 2040 
tctatggcat ttatcacggt actcaaaatg gtcattgtgg taattggcat gctggccacg 2100 
ggcctcagtc ccaccacccg tactgacccc gatgacccca agatcacaat tgtctcctgt 2160 
aaccccaact accgcaacag cctgctgttc aacaccagcc tggacctgct gctctcagtg 2220 
gtgggtttca gcttcgccta catgggcaaa gagctgccca ccaactacaa cgaggccaag 2280 
ttcatcaccc tcagcatgac cttctatttc acctcatccg tctccctctg caccttcatg 2340 
tctgcctaca gcggggtgct ggtcaccatc gtggacctct tggtcactgt gctcaacctc 2400 
ctggccatca gcctgggcta cttcggcccc aagtgctaca tgatcctctt ctacccggag 2460 
cgcaacacgc ccgcctactt caacagcatg atccagggct acaccatgag gagggactag 2520 



<210> 24 

<211> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Six-His tag 



<400> 24 

His His His His His His 
1 5 



